Entropies and combinatorics of random branching processes and context-free languages
نویسندگان
چکیده
AWract-The entropies and combinatorics of trees that branch according to fixed but finite numbers of rules are studied. Context-free grammars are used to categorize the ways in which nodes branch to yield daughter nodes, thus providing an organized setting to examine the entropies for random branching processes whose realizations are trees and whose probabilities are determined by probabilities associated to the substitution rules of the grammar. Normalized entropy rates H are derived for the critical branching rate (p = 1) and supercritical branching rate (p > 1) processes. An equipartition theorem is proven for the supercritical processes proving that L-generation trees normalized by their number of nodes have log probability converging to the entropy rate H with L, almost everywhere in the nonextinction set. A strong departure from classical theorems for Markov sources occurs for super-critical branching processes p > 1 as the typical sets have super-geometric growth rates. Defining the a-typical set of trees to be the L-generation trees with log of their negative log probability within 6 of log p, then the typical set has probability equaling the nonextinction probability and log growth rate of pL. The combinatorics of the set of all trees that can be generated from the context-free substitution rules is also studied. It is proven that for all context-free grammars that are strongly connected and have at least one substitution rule with two daughters or more, the combinatoric growth rate of the set of trees is also supergeometric and equals the largest growth rate of any random branching process with the same substitution rules. Instances of regular, pseudo-linear and context-free grammars are studied for demonstrating the theory, and as a particular example it is shown that the arithmetic expression language has log-number of unique L-generation programs growing at a rate 1.75488L.
منابع مشابه
Complexity Measures and Cellular
A bstract . Various observables meas urin g the complexi ty of an ensemble of pattern s are discussed , in par ticul ar statistical quantit ies rela ted to the convergence of block entropies, an d comp utation t heoret ical quanti ties related to a gram matical description of t he ensem ble. T hese measures of com plexity are applied to one-dime nsio nal cellular automata, by char acterizing th...
متن کاملOn the Length of the Wadge Hierarchy of Omega-Context Free Languages
We prove in this paper that the length of the Wadge hierarchy of ω-context free languages is greater than the Cantor ordinal εω, which is the ω th fixed point of the ordinal exponentiation of base ω. We show also that there exist some Σ0ω-complete ω-context free languages, improving previous results on ω-context free languages and the Borel hierarchy.
متن کاملConjunctive Grammars
This paper introduces a class of formal grammars made up by augmenting the formalism of context-free grammars with an explicit set-theoretic intersection operation. It is shown that conjunctive grammars can generate some important non-contextfree language constructs, including those not in the intersection closure of context-free languages, and that they can provide very succinct descriptions o...
متن کاملStatistical Properties of Probabilistic Context-Free Grammars
We prove a number of useful results about probabilistic context-free grammars (PCFGs) and their Gibbs representations. We present a method, called the relative weighted frequency method, to assign production probabilities that impose proper PCFG distributions on finite parses. We demonstrate that these distributions have finite entropies. In addition, under the distributions, sizes of parses ha...
متن کاملCentral Limit Theorem in Multitype Branching Random Walk
A discrete time multitype (p-type) branching random walk on the real line R is considered. The positions of the j-type individuals in the n-th generation form a point process. The asymptotic behavior of these point processes, when the generation size tends to infinity, is studied. The central limit theorem is proved.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Information Theory
دوره 38 شماره
صفحات -
تاریخ انتشار 1992